ON THE INTERNAL DISTANCE IN THE INTERLACEMENT SET 
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Abstract. We prove a shape theorem for the internal (graph) distance on the 
interlacement set I u of the random interlacement model on Z d , d > 3. We provide 
large deviation estimates for the internal distance of distant points in this set, and 
use these estimates to study the internal distance on the range of a simple random 
walk on a discrete torus. 



1. Introduction and the results 

5_i . We study properties of the interlacement set X" of the random interlacement 

model. We are mainly interested in its connectivity properties, in particular in 
the internal distance (sometimes called the chemical distance) on the interlacement 
cluster. 

The random interlacement model was introduced in [SznlO] in order to describe 
the microscopic structure in the bulk which arises when studying the disconnection 
time of a discrete cylinder or the vacant set of random walk on a discrete torus. It 
can be informally described as a dependent site percolation on Z d , d > 3, which is 
'generated' by a Poisson cloud of independent simple random walks whose intensity 
is driven by a non-negative multiplicative parameter u. The set covered by these 
random walks is called the interlacement set at level u and is denoted by X". As the 
precise definition of X" is rather lengthy, we postpone it to Section [2] and state our 
results first. 

Let Pg = P[ • | G X"] be the conditional distribution given that the origin is in 
the interlacement set X u . For x, y G X u we define p u (x, y) to be the internal distance 
between x and y within the interlacement set X": 

p u (x, y) = min{ra : there exist Xq, X\, . . . ,x n G X u such that Xo = x, x n = y, 

and \\xk — Xk-i\\l = 1 f° r all A; = 1, ... , n}, 

where || • ||i denotes the £i-norm in Z d . As we shall see below, the set X u is P-a.s. 
connected for all u, so p u (x,y) < oo for all u > and x,y G X". Assuming that 
x G I u , let A u (x, n) = {y G X" : p u (x, y) < n} be the ball centred at x with radius n 
in the internal distance. We abbreviate A u (n) := A w (0, n). 

The first main result of this paper is the shape theorem for large balls in the 
internal distance. 

Theorem 1.1. For every u > and d > 3 there exists a compact convex set D u C M d 
such that for any e > there exists a F^-a.s. finite random variable N such that 

((1 - e)nD u n X u ) C A u (n) C (1 + e)nD u 

for all n > N. 
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Remark 1.2. Clearly, the set D u is symmetric under rotations and reflections of Z d 
and D u C {x G M. : ||x||i < 1} for all u. It is straightforward to show that 
D u — > {x G M. d : ||x||i < 1} as u — > oo; it would be interesting, however, to be 
able to say something about the behaviour of D u when u — > (e.g., does the shape 
become close to the Euclidean ball, and what can be said about the size of D u as 
u ->• 0?). 

The key technical step in the proof of Theorem 11.11 is a fact (which is of indepen- 
dent interest) that the distance within the interlacement cluster should typically be 
of the same order as the usual distance. 

Theorem 1.3. For every u > and d > 3 there exist constants C, C < oo and 
5 G (0, 1) such that 

F^there exists x G X u H [-n,n] d such that p u (0,x) > Cn] < C e' n& . 

A corresponding result for the Bernoulli percolation on Z d was proved by Antal 
and Pisztora; in their case the constant 5 equals one and is optimal, see [A~P96, 
Theorem 1.1]. We did not try to optimise the constant 5 in Theorem 11.31 

Remark 1.4. It is trivial to replace Pq by P in Theorems 11.11 and ll.3[ To this end it 
is only necessary to extend p u (x, y) to all x, y G Z d by setting 

p u (x,y) = p u (x u ,y u ), 

where x u (respectively, y u ) is the closest point to x (respectively, y) on X u (one can 
choose the rule how ties are broken in any convenient translational-invariant way). 

The methods used to show Theorem 11.11 also imply the following result. 
Theorem 1.5. It holds that F[I U is connected for all u > 0] = 1. 

Previously it was known that for every fixed u > 0, the set X u is P-a.s. connected 
(see (2.21) in |Sznl0j ); the above theorem means that P-a.s. there are no 'excep- 
tional values' of the parameter u. Remark also that much more is known about the 
connectivity of X u for fixed u, see |PT1H, IRS12] . 

Theorems 11.11 and 11.31 indicate that the interlacement set X u looks at large scales 
very much like T, d . In the same direction, Rath and Sapozhnikov recently proved 
that the interlacement set X u percolates in slabs |RSllaj . and that random walk 
on X u is transient |RSllbj . 

Theorem 11.31 can be also used to answer a related question: 'How much the range 
of the random walk on the torus resembles the torus?' To this end we consider 
(Xfc)fc G pj to be a simple random walk on the discrete d- dimensional torus of size N, 
T% = (Z/iVZ) d , and write P N for its law when started from the uniform distribution. 
We let Zjy to denote the range of the random walk up to time uN d , 

Tn = {A , . . . , X[ uN dj }. 

Let Pn(x, y) be the minimal distance of x, y G within X^, defined similarly as p u , 
and let djsr(x,y) be their usual graph distance on the torus. 

Theorem 1.6. For large enough C and'y, we have 

lim P N \pn( x , y) < Cdjsfix, y) for all x, y G X^ such that d^ix, y) > In 7 N] = 1. 

JV— >oo 
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This theorem improves the result of Shellef [ShelOj . where a similar claim was 
proved for C growing very slowly with N using entirely different methods. More 
precisely, |ShelO] requires C = hv k ' N where ln^" is the fc-times iterated logarithm, 
k > 1 being arbitrary. On the other hand, Shellef needs 7 = 5d only; we do not 
have control on the size of this constant. 

The main difficulty of the paper stems in proving our results for d < 5, in par- 
ticular for d = 3. In fact, for d > 5 there is a rather simple argument, based on 
the results of |RSllbj . which shows Theorem 11.31 with 6 = 1, and which we sketch 
in the Appendix. This argument uses the fact that for d > 5 the random interlace- 
ment restricted to a thick-enough two-dimensional slab dominates in some sense the 
standard Bernoulli percolation, which allows an application of [AP96j . Heuristically, 
in large dimensions it is possible to construct 'long straight connections' within X u 
locally, independently of the connections in other places. 

It seems that this argument cannot be extended to d < 5. It is much harder 
to construct the straight connections locally in an independent manner. This we 
do in Section [HI where we dominate the internal distance between the origin and 
the point (n, 0, . . . , 0) by the sum of a sequence of random variables with a finite 
range of dependence and stretched exponential tails, cf. (16. lip below. To obtain 
the finite range of dependence, we should show that connections within a large box 
of size m can be constructed using less than 0(m d_2 ) random walk trajectories 
(which is the typical number of random walks intersecting this box; here and in 
the sequel we write f(m) = Q(g(m)) when for positive constants C\,C2 we have 
Cig(m) < f(m) < C2#(m) for all m). In fact, in Proposition 14.21 we will show that a 
'backbone' of X u in this box can be constructed using 0(m d ~ 2 ~ h ) trajectories only, 
h < 2/d. This also means that for every u > the interlacement set X u is 'largely 
supercritical', that is it remains locally connected, even when considerably thinned. 

The paper is organised as follows. After introducing the notation in Section |5J we 
collect in Section [3] some estimates on the hitting probabilities of sets and on the 
range of the simple random walk. Section H] contains the key technical result of this 
paper, Proposition 14.21 This proposition roughly states that all points in (a possibly 
thinned version of) the set X u within box of size n are at internal distance n 2 , with 
a very high probability. Using this proposition, in Section |5l we give a short proof 
of Theorem 11.51 Sections [6H8] contain the proofs of Theorems 11.31 QUI an d 11.61 

Acknowledgements. The authors would like to thank Augusto Teixeira for many 
useful discussions, and Balazs Rath for pointing out Shellef 's paper |Shel0j . The 
work of Serguei Popov was partially supported by CNPq (301644/2011-0) and 
FAPESP (2009/52379-8). 

2. Preliminaries 

In this section we fix the notation and recall the definition of the random inter- 
lacement model. 

Let N = {0, 1, . . . } be the set of natural numbers. We denote with ei, . . . , the 
coordinate vectors in Z d , and write || • ||, || • ||i, || • for the Euclidean, and 
norms correspondingly. We use B(x, r) to denote the closed || • Hoo-ball centred at x 
with radius r, and abbreviate B(r) := -8(0, r). We say that A C Z d is connected if for 
any x, y G A there is a nearest-neighbor path that lies fully inside A and connects x 
to y. We write \A\ for the cardinality of A, diam(A) = max XiS/eJ 4 \\x — yWoo for its 
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diameter in ^-norm, and OA = {x G A : By G A c , \\x — y\\ = 1} for its internal 
boundary. 

Let us write P x for the law of a discrete-time simple random walk (X n ) ng ^ ori Z d 
started from x. For A C Z d we denote with Ha-, Ha and Ta the entrance time in A, 
the hitting time of A, and the exit time from A: 

H A = inf{n > : X n G A}, 

Ha = inf{n > 1 : X n G A}, (2.1) 
T A = inf{n > : X n $ A}. 

Given A C 1* d finite, we define the equilibrium measure of A by 

e A (x) = P X [H A = oo]l A (x) 

and denote by cap(A) = Y^ x eA e A{ x ) its total mass. 

We now recall the definition of the random interlacement from |SznlOj . In order 
to do this we need to introduce another notation which is, however, mostly used 
only locally. Let W be the space of doubly-infinite nearest-neighbour trajectories 
in Z d which tend to infinity at positive and negative infinite times, and let W* be the 
space of equivalence classes of trajectories in W modulo time-shift. (These spaces 
are equipped with cr-algebras W, W* as in (1.2), (1.10) of [SznlOj.) The random 
interlacement is defined via a Poisson point process taking values in the space Q of 
point measures on the space W* x [0, oo) with the intensity measure v ® du. We 
denote by P the law of this process. 

To describe the measure v appearing in the intensity of the Poisson point process, 
for A C Z d , u > 0, we denote by n A the mapping from Q to the space of point 
measures on W which selects from u G £1 the trajectories with labels smaller than u 
intersecting A and parametrises them so that they enter A at time 0. Formally, for 
u = J2i>o S (wt,Ui) G f2 , w* G W*, u % > 0, we define 

/4M = X) <j mK) 1 { Rmi «) n a ^ 0, Mi < u}, (2.2) 

where Ran(w*) = Unez u; ( n ) ^ or an arbitrary w in the equivalence class of w*, and 
sa{w*) is the unique w G W in this equivalence class such that wo G A, W- n £ A, 
n > 0. As follows from [SznlOj . Theorem 1.1, the measure v is uniquely determined 
by the following two properties which we will frequently use: 

• For every finite set A C Z d , under P, the number r] A := fi A (u)(W) of trajec- 
tories in oo with labels smaller than u entering A has the Poisson distribution 
with parameter Mcap(A). 

• Let /i^(w) = z2i=i^tiiiJ w i e W. Then, under P, Wi are i.i.d., independent of 
f] A , with the law given by 

%.W) n >o^] = E#4f]. 

for any measurable set F in the space of single-infinite nearest-neighbour 
paths. It means that w iy restricted to non- negative times, are i.i.d. simple 
random walk trajectories started from the normalised equilibrium measure 
e A {-)/e A {A). 
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The interlacement set at level u is then defined as the trace of all trajectories in u 
with labels smaller than u, 

X u (u) = \jRtm(w*)l{ui < u}. 

i>0 

We now explain the conventions for the use of constants in this paper. We denote 
by C, Ci, C[, C 2 , . . . the 'global' constants, that is, those that are used all along the 
paper and by c, c', ci, c 2 , c 3 , . . . the 'local' constants, that is, those that are used only 
in the small neighbourhood of the place where they appear for the first time. For the 
local constants, we restart the numeration either in the beginning of each subsection 
or in the beginning of each long proof. All these constants are positive and finite and 
may depend on dimension, it, and other quantities that are supposed to be fixed; 
usually we omit expressions like 'there exist positive constants ci,C2 such that . . . ' 
and just directly insert c's to the formulas. 

Also, the reader will notice that very frequently in this paper the probability of 
events (indexed by some integer parameter, say, n) will happen to be bounded from 
above by e~ cn or from below by 1 — e~ cn , where S is typically (but not necessarily) 
between and 1. So, we decided to use the following definition: 

Definition 2.1. We say that f(n) is s.e. -small (where s.e. stands for 'stretched- 
exponentially') if for all n > 1 it holds that 

0<f(n)< Cl e- c * nC \ 

and write f(n) = s.t(n). 

Observe that n c s.z.{n) = s.z.(n) for any fixed c > 0. So, it is quite convenient to 
use this notation e.g. in the following situation: assume that we have at most n c 
events, each of probability bounded from above by s.e.(n). Then, the probability of 
their union is s.e. (n) as well. 

3. Estimates on hitting probabilities 

In this section we collect several estimates on hitting probabilities of subsets of Z d 
by random walk trajectories. We recall that P x denotes the law of the simple random 
walk (X n )„ e N in Z d , d > 3, starting at x. We denote by g the 'stopped' Green 
function: 

n 

g(x,y;n) = ^P x [X k = y], 

k=0 

and write g(x,y) for g(x,y; oo). For the case d > 3 it holds that g(x,y) is finite for 
all x, y 6 7, d , g(x, y; n) = g(y, x; n) = g(0, y — x;n), and, for all n > \\x — y\\ 2 

C 

g(x,y;n)>-— — 1 2 , (3.1) 
1 + \\x — y\\ a z 

g(x,y)< C ' x , (3.2) 
1 + ||x — y\ a 1 

for all x, y G T, d . The upper bound ( 13. 2p follows directly from Theorem 1.5.4 
of |Law91j . The lower bound (13. ip can be proved easily adapting the proof of the 
same theorem. 

For n > 0,x G Z d , A C Z d , let 

q x (A;n) = P X [H A < n] 
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be the probability that, starting from x, the simple random walk enters A before 
time n. We use the abbreviation q x (y;n) := q x {{y}',n) for the hitting probabilities 
of one-point sets, and q x {A) := q x (A; oo) for the probability that the simple random 
walk ever enters the set A. It is elementary to obtain that for all x, y G Z d and 
n > \\x - y\\ 2 (see e.g. Theorem 2.2 of |AMP02| ) 

1 + \\x — y\\ a 2 

11 c , y " (3.3) 

qM ~ l + \\x-y\\^ 
Next, for iGZ d and a finite set A C Z d , define 

g(x,A;n) = ^2g(x,y;n). 

yeA 

Clearly, g(x, A; n) is the expected number of visits to A up to time n, starting from x. 
As before, we set g(x, A) := g(x, A; oo). 

The following lemma will be used repeatedly to estimate the hitting probabilities: 

Lemma 3.1. For all x G Z d , finite A C Z d , and < n < oo 

q(x, A:n) . . . o(x, A) 

< &(A;n) < , 1 '/ .. . (3.4) 



max^eA g(y, A; n) "' ' ~ mm yeA g(y,A)' 
Proof. Using the definition of g and the strong Markov property, 

g{x, A) = J2 p 4Ha < oo, X Ha = y}g(y, A) 

> min g(y, A) y2P x [H A < oo,X Ha = y]. 

yeA ■ ' 

y eA 

Since q x (A;n) < q x {A) = Y. y eA P A H A < oo,X Ha = y], the second inequality 
in (13. 4 p follows. The first inequality is then implied by 

n 

g(x, A;n) = ^Y^ P x[ h a = k, X Ha = y}g(y, A;n-k) 

k=0 yeA 
n 

<J2J2 p *[ Ha = k > Xh * = yteiv, A ~> n ) 

k=0 yeA 

n 

< maxg(y, A; n) V] Y] P X [H A = k, X Ha = y], 

yeA ' 4 ' 4 

k=l yeA 

together with q x (A; n) = Y2=o Y. y eA P x[Ha = k, X Ha = y\. □ 

Let us use the notation £(x,A) = max^^ ||x — yW^ for the maximal distance 
between x and the points of A. Two following simple lemmas contain lower bounds 
on hitting probabilities of sets. 
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Lemma 3.2. Suppose that A is a connected finite subset of Z> d , containing at least 
two sites. Then, for all x G Z d and n > (£(x, A)) 2 , 

C 3 diam(A) 



q x (A; n) > 



(£(x,A)) rf - 2 lndiam(A)' 
C 3 diam(A) 



d = 3, 
d > 4. 



{£{x,A)) d - 27 

Proof. Since A is connected, it is possible to find (not necessarily connected) set 
A' C A with the following properties: 

• \A'\= diam(A), 

• one can represent A' = {xi, . . . , £diam(A)} in such a way that — Xj\\oo > 
\i — j\ for all i, j = 1, . . . , diam(A). 

Indeed, it holds that the size of the projection of A on one of the coordinate axes is at 
least diam(A) and this projection is an interval; then, for all points in the projection 
pick exactly one element of A that projects there, and erase 'unnecessary' points 
of A. Then, by ( 13. ip we have for any n > {£{x, A)) 2 

a(x A '. n ) > C.dmmjA) 
9[X ' A ' n) - l + (£(x,A)Y- 2 > 

and, by f 13 . 2 1) . for any y G A', 

diam(A) . ( . » 

y 2fi klndm(4 d = 3, 



Since ^(A; n) > q x (A'; n) for all n, the claim follows from Lemma [3. II □ 

The previous lemma works well for sparse connected sets. For more densely packed 
sets we need another estimate: 

Lemma 3.3. For all x G r L d , finite A C Z d containing at least two sites, and all 
n>(£(x,A)) 2 , 

q ^ H) ^ {£{x,A)Y- 2 - 
Proof. Again using ( 13. ip . we have for any n > (£(x, A)) 2 

To obtain an upper bound on g(y, A; n) for y G A, we observe that 

\{xeZ d : \\x\\ e[k,k + l)}| = e^- 1 ). 
So, using ( I3.2p . we have for y G A 

g(y, A;n) <Y^ - „ Cl ... - 

zeA 113 11 



fc=0 

< c 5 |^| 2/rf 



M-2 

fc=0 " ' 
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where we have used an obvious worst-case estimate (all the points of A are grouped 
around y, forming roughly a ball of radius 0(|v4| 1//a! )) on the passage from the first 
to the second line of the above display. Then, applying Lemma 13.11 we conclude the 
proof of Lemma 13.31 □ 

We end this section by stating a few well-known facts about the behavior of the 
set of sites visited by a simple random walk by time n. As we could not locate 
suitable references, we also sketch their proofs. 

Lemma 3.4. Suppose that d > 3 and let R(n) = {X , . . . ,X n } be the set of sites 
visited by a simple random walk by time n. Then, for any fixed ati G (0, 1), 

Pfn 1 ^ 1 < diam( J R(n 2 )) < n 1+a \ \R{n 2 )\ > n 2 ~ 2ai ] > 1 -s.e.(n). 

Proof. The upper bound on the diameter follows from any convenient large deviation 
bound on the displacement of the simple random walk (e.g. Lemma 1.5.1 of | Law91j ). 

To control the diameter and the number of visited sites from below, we use the 
following simple argument: We divide the temporal interval [0,n 2 ] into c~ l n 2ai 
subintervals of length cn 2 ~ 2ai , for a large enough c. Clearly, on each of the subin- 
tervals of length cn 2 ~ 2ai the maximal displacement of the simple random walk is at 
least n l ~ ai with a constant probability, e.g., by the central limit theorem. Noting 
that by time k the number of visited sites is at most k, and that the expectation of 
this number is at least c'k (it is straightforward to obtain this from (13. ip ). we deduce 
that also with at least constant probability the number of different sites visited by 
the random walk during a fixed temporal interval of length cn 2 ~ 2otl is at least n 2 ~ 2otl 
(if c is large enough). Finally, to estimate the probability that the event of interest 
occurs on at least one of the c~ l n 2ctl subintervals, use the independence. The claim 
then follows easily. □ 

We also need an estimate on the number of different sites visited by several random 
walks: 

Lemma 3.5. Consider k independent simple random walks (X,- )j>o, ■ ■ ■ , (^j)j>o 

started from arbitrary points x^\ . . . ,x^ k \ and denote Rj{m) = {Xq\ . . . ,Xm}, 
j = 1, . . . , k. Assume that n h < k < n d ~ 2 for some fixed h e (0, d — 2). Then, for 
any a 3 G (0, h) we have 

k 

P [| |J Rj{n 2 ) > kn 2 - a3 > 1 - s.e.(n). 

i=i 

Proof. We use a similar argument as in the previous proof. We divide the k walks 
into c _1 n a3 groups, each containing ckn~ as walks. Consider now the ckn~ a3 walks 
of the, say, first group, suppose that they are labelled from 1 to ckn~ az . Let 

ckn- a 3 

V= |J R {n 2 ) 
i=i 

be the set of sites visited by the walks from the first group. For y G Z d , define 

ckn~ a 'i 

C(y)= Yl l{lk a) -2/|l<n} 

3=1 

1 For any random variable £ with < £ < a a.s. and > b, it is true that P[£ > b/2] > b/ (2a). 
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to be the number of walks of the first group that start at distance at most n from y. 
By (I3.3p . using < k < n d ~ 2 , we have 

P[yeV] = l- k \{ (l-q x{]) {y-n*))> C ' ay) 

3=1 

So, if c is large enough 

e\v\ = P[y^v} 



n d-2 



> 



d 



n d-2 



c'ckn " 3 , , 

^ n d-2 \{v- \\v\\< n }\ 

> 2kn 2 ~ a3 . 

Since, trivially, \V\ < ckn 2 ~ a:i , it holds that \V\ > kn 2 ^ a3 with at least a constant 
probability. As the same reasoning applies to each of the c~ l n az groups, the claim 
of the lemma follows by independence. □ 



4. Intersections of random walks 

In this section we show that the set of points visited by sufficiently many walks 
started in B(n) is typically well connected; the precise statement of this fact is 
contained in Proposition 14.21 

To state this proposition we need some notation. We consider two sequences of 
positive random variables ?7i satisfying f)i < fj^ 1 an d 

P[^ n) > C 5 n d - 2 - h ] > 1 - s.c.(n), (4.1) 

P[^ n) < C 6 n M ] > 1 - s.c.(n), (4.2) 

for some h < | and M > 0. Let . . . , X^ 2 ) be fj^ independent simple random 
walks starting from some sites . . . ,x^ 2 ) G B(n). We write P for the joint 
distribution of these walks. Let -Rfe(m) = {X^ k \ . . . ,Xm^} be the set of different 
sites visited by kth random walk until time m. We write H\ for the entrance 
and hitting time of A by random walk X^ (recall (12. ip ). 

Definition 4.1. For integers s,m > 1 we say that X® is (s, m)-connected to X^> 
if there exist a sequence of integers i = k , ki, . . . , k s = j such that 

kt < Vi > for allt = 0, . . . , s 

Rk t ijn) fl -Rfc t „i (m) 7^ 0, for allt = 1, . . . , s. 

(We do not indicate the dependence on n in order to keep the notations not too 
heavy.) 

In words, the definition says that the trajectories are (s, m)-connected if one 
can go from the starting point of the ith trajectory to the starting point of the jth 
trajectory within the cluster of the first fj^ trajectories, by changing no more than s 
times the trajectory, and using at most m sites in the beginning of each trajectory. 
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Let us define for k < i)^ the following set of integers: 

r(k) 

l "3(m+l)n 2 



L k = {m > 1 : {xWp, . . . H B(n) ? 0}, 



and let 

JC»> = {A; < : \L k \ = 0} (4.3) 
be the index set of the walks that do not come back to £?(n) after the time 3n 2 . 



For d > 3 and h < |, define 



fc) := min jfc > 1 : ^ + (d - 3 + h - y ) (l - * * < 1 } (4.4) 

(in fact, this quantity represents the necessary number of steps in the recursive 
construction used in the proof of Proposition I4.2[ see ( 14. 8 p and (|4.15J1 : at this point 
we only observe that (3(d,h) is finite since 4^ < 1). 

The following proposition plays the key role in this paper: 

Proposition 4.2. Let ff[\ fj^ and X^, k < fj^ > be as above. Then 

P[ViJ < andX {j) are (2/3(d,h) + 1, 2n 2 ) -connected] > l-s.t.(n). (4.5) 

Moreover, 

P[Vi < 3j < r?i n) such that R^n 2 ) n Rj(2n 2 ) ^ 0] > 1 - s.e.(n), (4.6) 

and 

P [Vi < 172"^ Vm G Li 3j < f)i^ such that 

{*2L» • • • ' 4Ui)n 2 -J H R 3 (2n 2 ) * 0] > 1 - «t(n). (4 ' T) 

Remark 4.3. (a) The estimates in the above proposition only depend on the number 
of walks that we consider, they are uniform with respect to the choice of the starting 
positions. 

(b) Typically, when applying Proposition 14.21 to the interlacement set (say, in the 
ball B(n)), the variables fj^, 773 wm be of order n d ~ 2 , so that h = 0. The proposi- 
tion implies that the model of random interlacements is 'far from the criticality' with 
respect to the connectedness of the interlacement cluster; we typically need much 
less than Q(n d ~ 2 ) walks to ensure that the interlacement set is 'well connected'. 

(c) In the most important case h — 0, it holds that /3(3, 0) = l,/3(4,0) = 
2, ,3(5,0) = 3,^(6,0) = 4, but then 0(7,0) = 6. Comparing this with the re- 
sults of |R S12} IPT11] (where it is proved that every two points in X u can be joined 
by a path switching the trajectory at most (|~d/2] — l)-times) indicates that the 
constants f3{d,h) are not optimal. The authors did not check if the formula (I4.4p 
can be further simplified, but it is clear that (3(d, h) = Q(dlnd) as d — > 00. In any 
case, for our needs it is enough to know that (3(d, h) is finite for any d > 3 and 
h < 2/d, and this fact is quite obvious. 

First, let us describe informally the idea of the proof for the particular case 
h = (one may note that there are many similarities with the proof of Theo- 
rem 3.2 of [AMP02J, and with techniques used in |RS12] ). Consider the random 
walk X^ and run it up to time n 2 . Then, diam(i? 1 (ra 2 )) is typically of order n, so 
any other random walk X^ hits the set R\{n 2 ) with probability at least of order 
roughly n~( d ~ 3 ) (with logarithmic correction for d = 3) by Lemma 13.21 Since there 
are ®{n d ~ 2 ) other available walks, with high probability Ri(n 2 ) will be hit by 0(n) 
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Figure 1 . On the proof of Proposition 14.21 for d — 4. One considers 
first the trajectory of some particle (labelled here by '1') up to time n 2 . 
The trajectories of particles labelled by '2' meet the first trajectory 
(the small boxes indicate the corresponding places of first hitting). 
The particles labelled by '3' then hit the set formed by the trajectories 
with labels '1' and '2'. Continuing the trajectories of '3'-particles a bit 
more, one finds a 'generic' trajectory (the thick one on this picture) 
with very high probability. 

different other walks. In dimension d = 3, running these Q(n) walks for n 2 time 
units more after the respective hitting moments of R\{n 2 ) is already enough to meet 
all the other trajectories (again applying Lemma l3T2| one obtains that the probabil- 
ity that any other trajectory hits none of those walks is almost exponentially small 
in n). In dimension d > 4 this argument, however, just barely does not work. 

So, what to do in dimension 4? Consider those Q(n) trajectories (of length n 2 ) 
that intersect the initial one. Together with the initial trajectory, they form a 
connected set of cardinality roughly n 3 . We then apply Lemma [3.31 to obtain that a 
random walk starting somewhere at the boundary of B(n) will hit such a set with 
probability at least of order n~ 2 x n 3 ^ 1 ^^). Since (recall that now d = 4) we have 
Q(n 2 ) walks in total, typically ©(n 3 *- 1- ^)) of them will hit that set. Since in four 
dimensions Lemma 13.21 gives lower bound of order n _1 for the hitting probability 
of the initial piece of length n 2 of a generic trajectory and 3(1 — |) = | > 1, 
running these 0(n 3 / 2 ) walks a bit more we meet all the other trajectories with high 
probability (see on Figure [1] an illustration of the proof for d — 4). 

Again, in dimension d = 5 this fails since Lemma 13.21 now gives a lower bound 
of order n~ 2 . However, iterating the above construction, we then obtain roughly 
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6( y2+3(i-§))(i-§)) independent walks, and, since (2 + 3(1 - |))(1 - §) = § > 2, 
these are enough to detect all the other walks. For any fixed d one can perform 
enough iterations to make this construction work. 
If we recursively define the sequence 

a[ d) = l, a& = (fl« + 2)(l-H), (4.8) 
then the necessary number of iterations /3(d, 0) can be calculated as follows: 

0(d, 0) = min{£; : af ] > d - 3}. 
Since it is straightforward to obtain from the recursion (I4.8P that 

a^ = d-2-(d-3)(l-^y~\ 

we see that the above definition of f3(d, 0) agrees to (14.41) . 

In order to make the above argument rigorous, we have to address several issues, 
for example: 

• Deal with the dependence of the walks that participate in different stages 
of the above construction. This can be done by dividing the walks we use 
into p(d, h) groups and use one group on each stage. 

• In fact, the trajectories can go back to the ball B{n) at later epochs (i.e., 
much later then n 2 ). To prove (14.71) . we have to assure that the random 
walks constructed on the (3(d, h)th stage would meet these pieces of the 
trajectories too, otherwise we would have no good control on the distance 
within the interlacement cluster. So, we have to control the 'total number of 
returns' (see (14. lip below). In addition, in the above construction we shall 
use only the walks conditioned on not returning to B[n) after time 3n 2 (in 
order not to be obliged to condition on a too much detailed future behaviour 
of the trajectory). 

• Finally, all the events described in the informal construction should not only 
be 'typical' in some sense, but hold with probability at least 1 — s.e.(n). For 
that, we need to 'adjust' (by sufficiently small amounts) the values in the 
power of n on each stage. 

Proof of Proposition We start with the formal proof of Proposition 14.21 To 



simplify the notation we write (3 = /3(d, h). Recall (14.31) and define for m = 1, . . . , /3 

Since, clearly, there is a constant C4 > such that for all x G B(n) we have 

P x [X m i B(n) for all m > n 2 } > c 4 , (4.9) 

we obtain that 

^[l^l ^ c 5 n d - 2 - h for all m = 1, . . .,0\ > 1 -s.e.(n). (4.10) 

Inequality (14. 9p further implies that that for every k, m > 1 

P[|L fc | > m] < e~ C7m . (4.11) 

In the sequel, we will repeatedly use the following observation. For a simple 
random walk X, let X[ 0) 2n 2 ] be the piece of trajectory of the walk X up to time In 2 . 
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Then there is a constant eg > such that for any event A which depends only on 
the initial piece of the trajectory of length 2n 2 

P[X [0j2n2] e A | X m B{n) for all m > 3n 2 ] > c 8 P[X [0j2n2] G A]. (4.12) 

Indeed, to prove (14.121) . we write 

^P[^[o,2n2] G A | X m ^ B{n) for all m > 3n 2 } 

> P[X [0; 2n2] G A, X m ^ B{n) for all m > 3n 2 ] 

= P[X [0j2n2] G A]P[X m i B(n) for all m > 3n 2 | X [0)2 „2] G A], 

> P[Xr 2n2i G A] inf P,; X m 4: B(n) for all m> n 2 

x£Z d l 

and use (14. 9 p to argue that the last term is at least of constant order. 



As a last preparatory observation, note that, for any e > 0, by Lemma [3.41 and 
the observation following Definition I2.1[ 

for all k < fj^\diam(R k (n 2 )) > n l ~ £ , 1 > i _ (\ (a-io\ 

diam({X« n2 , . . . iX to +1)na _J) > n 1 - for all m G L k \ ~ ^ ^ 

Let %\ = min{j : j G J±}, and define 

^ = P n (n 2 AT- (2n) ). 

For any j we obtain using Lemma l3.2[ and (I4.12p with A = {Rj(n 2 ) nV\ ^ 0}, 

c 9 diam(Vi) _ 

p[p,(™ 2 ) n y x ^ | j g jj n) ] > <j (4.14) 

C9di 7f l} , d > 4. 

We introduce the set of indices K x = {j G j[ n} \ {h} : Rj{n 2 ) n Vi ^ 0}. By fOT)]) . 
(14.131) . and (I4.14p . using the independence the random walks X^\ it holds that 

P[\K X \ > n^ h - £l ] > 1 -s.z.(n), 

where E\ := 2e (e is supposed to be sufficiently small so that 1 — h — E\ > 0). 

For g? = 3, everything is ready to finish the proof of Proposition 14. 2\ but for other 
values of d we first need to describe a general step of the construction (recall that f3 
steps are necessary). Define recursively (recall (14. 8p ) 

a\ = ai(d, h) = 1 — h, 

( 2\ (4.15) 
a n +i = a n+ i(d, h) = (a n (d, h) + 2) [I - - j - h. 

From the above recursion it is straightforward to obtain that 

dh ( , , dh\ ( 2\ 

So, with (3 defined by (I4.4p . it holds that dp > d — 3. 

Assume that for some 1 < m < (3 — lwe have constructed the connected sets V m C 
Z d and also the sets K m C Jm^ of indices of the walks which hit V m before time n 2 , 
such that with probability at least 1 — s.z. (n) 

\K m \>n am - £m . (4.16) 
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Then, define 

V m+ i = V m u( |J R 3 (2n^V B{2n) )). 

By Lemma (observe that, by (|4.12|) . its proof still goes through in this situation) 
and (jUSD it holds that 

P [\V m+ i | > n 2+am - 2£m ] > 1 -fit(n). (4.17) 

Define K m+ % = {j G J^+i '■ Rj(n 2 ) fl V m ^ 0}. Observe that, by Lemma 13.31 
and dH2D with A = {R(n 2 ) n U m+1 ^ 0}, for any j G 

P[R 3 (n 2 ) n K„+i ^ 0] > 01 ( 4 - 18 ) 
So, using ( QUjl . (Q3j) . f HTTjl . and IBIS| . we obtain 

P[\K m+1 \ > n (2+«m-2em)(l-l)-A- e m] > J _ ^ , 



77, 



and (for the next induction step) denote e m+ \ = a m+ i — (2 + a m — 2e m )(l — |) + s m , 
so that (14.1 6p would hold with m + 1 instead of m. 

Now we describe the last step needed for the proof of (14. 5p . (14.61) . and ( 14. 7p . 
Assume that on the initial step the parameter e was chosen to be so small that 
dp — Bp > d — 3 + e. Consider the walks with indices in Kg\ after hitting Vp the rest 
of the trajectory is conditionally independent from the initial part, so Lemma [3.21 
and (14.121) imply that, for j G Kg 



d 



[X (3 ] ,...,X {j ] AnA^ 



> 



In n 

d > 4, 



n d-3 



for any connected set A C B(2n) such that diam(A) > n 1 e . Using this together 
with (14. lip and f !4.13j> . we conclude the proof of Proposition 14.21 □ 

5. Proof of Theorem 11.51 

Using Proposition I4.2[ it is straightforward to show Theorem 11.51 Denote by 
T]u the number of trajectories at level u entering B(n), that is the number of 
trajectories in the support fi^(n) ( recan (12. 2p for the notation). By the definition of 

random interlacement, r]u has Poisson distribution with parameter ucap(B(n)) = 
®(un d ~ 2 ). Therefore, using e.g. Chernoff bounds we obtain for small enough c\ and 
large enough c 2 that 

F[ Cl un d - 2 < r^ n) < c 2 un d - 2 ] > 1 - s.t.(n). (5.1) 
Assume for the moment that 

Pfthere exists u > such that X u is not connected] > 0. (5-2) 
Then, one can find e > and < u < u < oo such that 

P[there exists v! G [u, u] such that X u is not connected] > e, 
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and so, denoting by Xfs := U,-.,„ c„,__ Ranu>j the interlacement set generated 
by the trajectories that intersect B(n), we have 

liminfP [there exists it' G [«,«] such that Z/i^ is not connected] > e. 

This, however, contradicts Proposition 14.21 putting = 77^, 7)2 = % > an< ^ 
using (15. ip . we see that, if both events in the left-hand sides of (14.51) and (14.61) 
occur, then Tts should be connected for all u' G [u,u]\ on the other hand, the 
probability of these events approaches 1 as n — )■ oo. So, (15. 2p cannot be true. □ 

6. Large deviations for the internal distance 

In this section we prove Theorem 11.31 To this end we fix a G (0, 1/3) and investi- 
gate the properties of X u when restricted to 

n 

In words, G^ is the n a -neighbourhood of the segment between the origin and n&\ 
(recall that B(x,r) denotes the ball in the || • ||oo-distance). 
First, we need the following elementary estimate on e A). 

Lemma 6.1. Let be the hyperplane {x G Z d : x ■ e x = k}. Then, for any 
k G {- [n a \ + 1, . . . , [n + n a \ - I}, it holds that 

( Qfi a (d—3) 

e Q (n)(F k ) = P x [H G (n) = oo] > 

xedG^nF k " \cn a(d - 3 \ d > 4. 



. d 

In n 



Proof. We adapt the proof of Proposition 2.4.5 of |Law91] . Let G n = ([—An, An] x 
[— n a , n a } d ~ l ) fl 7L d . As G n — £e\ D Ga — hex for any k as in the statement and 
£e {-2n,...,2n}, 

P z [H Gi ^ = oc}> J2 PAHc n = oo]. (6.1) 

zedG^nFk zedG n nF e 

Let W = G n fl {x : \x • ei| > 2n}. It is elementary to see that P [H W = oo] > c. 
Inspecting the proof of |Law91| Proposition 2.4.1(c)], denoting by L the last time 
the random walk visits G n , we get 
c < Pq [H w = oo] 
<Pv[\X L -e l \<2n] 

g(0,z)P z [H Gn = oo] 

z£dG n ,\z-ei\<2n 
2n 

< sup g(0,z) P AH Gn = oo]. 

i=-2n^dG n nF e zedGn nF e 

Using ( 16. ip . we then get 

2n 

c< P ^ Gn = oo] Y sup g(0,z). 

zedG^nF, e= _ 2n zcdG n nF e 



ON THE INTERNAL DISTANCE IN THE INTERLACEMENT SET 16 

The lemma then follows using the elementary asymptotics g(0, z) x ||2;|| 2_d (re- 
call flZH HP )- D 

Let r] n = r) n (u, a) be the number of trajectories of \i u {n) (recall (12. 2p ). As before, we 

enumerate the corresponding random walks as . . . , X^ n \ denote their starting 
positions by x^\ . . . , x^ n \ and let Rk(m) be the set of different sites visited by kth 
random walk by time m. 
Let us define 

U k = B([kn a \e 1 ,n a ) = {yeG^:\ye 1 -[kn a \\<n a }, fe = 0,...,n. (6.2) 
Due to Lemma EIU for any k G {0, . . . , n}, 

? cn a(d-2) 
Inn 
cn a{d - 2 \ d>4. 



d = 3, 



Let r] nk = \{i < r] n : G £4}| be the number of walks starting in \J k . Using 
the large deviation properties of the Poisson distribution, as in (|5.1|) . we obtain 

P[ci/ d (n) < r]n,k < c 2 f d {n)] > 1 - st.{n). (6.3) 
We now fix a small positive constant e > 0, and define for 1 < k < i] n 
3u = inf{j > : X% +s)+{j _ 1)n2a+ . £ GW V< = 0, . . . ,n 2a }, 
4 = n 2 ^+j fc n 2a , 

** = inf{j > : diam( J R fc (j)) > 2n 2(a+e) } A i k . 

Denote by X (respectively, X) the 'interlacement' set formed only by the initial pieces 
of length tk (respectively, tk) of the trajectories (X^, k = 1, . . . , i] n ): 

11„ Tin 

x = [J Rk(ik), i = [J Rk{ik)- 

k=l k=l 

Observe that I" D I D I. Further, by the central limit theorem, for any x G G a ^ , 
P X [T (n) < n 2a ] > c. Therefore, using the strong Markov property recursively on the 
definition of jj., 

> n £ ] < s.t(n). 
When jk < n £ , then diam(i?fc(£fc)) < 2n 2<ya+£ \ Therefore, 

P[X^X] < s.t.(n). (6.4) 

Heuristically, the set X, is 'well suited' for application of Proposition 14.21 as it has 
no 'dangling ends' in ■ By this we mean that knowing that X^ is in Ga £L"t 
some time j, its next n 2a steps will be contained in I: 

{Xf } G Gjfij < 4} =► {i k > J + n 2a }. 

On the other hand, the trajectories in X are 'short range', which will introduce some 
independence later. 

We now introduce a notation that will be useful many times, see Figure [2] for its 
illustration. For x G Z d \ {0} and y G Z d , we define 

Co X \y) — max{m < : mx + y G X"}, 
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Ci%(y) = min{m > (^{y) :mx + yeX u }, k > 0. 

We set ij) k {y) '■— y + (Ck X \y)) x to be the site on X u corresponding to Q (y). When 
y = and/or x = ex, we omit them from the notation, that is e.g. (k '■= (i^i®)- 




Figure 2. Construction of Ct\v) and ^t\y)- Here Ct\v) = 
([ x \y) = 2, &\y)= 3 and (j?\y) = 5. 

As the first step of the proof of Theorem II .3[ we control the distance between the 
left- and right-most intersection of X u with the segment (0, nei] . More precisely, we 
want to show that 

P[p„Oi, ifo{nei)) > en] < s.z.(n) (6.5) 
for a c large. (Observe that Pg^o = 0] = 1.) 

We write p(x, y) for the internal distance of x, y on X and define ^^(y) similarly 
as ip\ {y), using X instead of X u . Observe that ^\ (y) depends on n. It may happen 
that ipi = ipx, ipo{ ne i) — ^oi^i)) but that is not certain. In any case, 

Pu(i/>i, ^o(nei)) < p u {i>i, ipi) + p{i>i, ipa(nei)) + p u (ipa(nei) , i/j (nei)). (6.6) 
To bound the right-hand side, we need few lemmas. 

Lemma 6.2. Let gd(k) = e~ Cik when d>A, and g^(k) = e~ C4k / lnk . Then, for every 
a E (0, 1/3), e > 0, x E Z d \ {0}, y G Z d , 

V[\\4 X \y)\\oo>k]<g d (k), (6.7) 

Further, for y such that B(y,n a / 2 ) C Ga, 

P[||^ ei) (2/)lloc> k] <g d (k)+s.t(n). 

Proof. Let Sk = {y + jx : < j < k}. The first claim follows directly from the defi- 
nition of X u (observe that for any A C Z d it holds that F U [A HX U = 0] = e ~ uc ^ A) ) 
and the simple estimate on the capacity of the 'segment' Sk (see e.g. |Law91j . Propo- 
sition 2.4.5) 

'e(fc/ In k), d = 3, 
e(k), d>4. 



c&pS k 
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For the second statement, we assume without loss of generality that y = 0, and 
define A n = {0 < k < n £ , k even}. For every j G A n , and x G Uj, by Lemma [3.21 

l((j + l)n a ) rf " 2 ' rf - 4 ' 
Combining this estimate with ( I6.3p . using > n 2<ya+e \ we obtain in d = 3, 



P[||A||oo > k] <s.t(n)+ 



((j + l)n°) d - 2 lnfc 

i \ f c zk sr-^ 1 



\nn\nk ^—f (j + l) d 2 



< e- C4k/hlk +s.t.{n). 

For d > 4 the calculation is very similar. Actually, it is sufficient to consider only 
the term j = 0, as there are no logarithmic terms in the denominator. □ 

As a consequence of the last lemma and Proposition 14.21 we obtain, 

V\pu$iM < c'n 2a ] > 1 - s.c(n), (6.8) 

P|/>„(A(neO, ^o(nei)) < c'n 2a ] > 1 - s.c(n). (6.9) 

Indeed, obviously 

(6.10) 

< P[{Vi,Vi} £ 5(n a )] +P[{V>i,Vi} C BKj.ft^ij) > c'n 2a ]. 

The first term in the right-hand side is s.c.(n) by Lemma [6.21 For the second one, 
it suffices to set d = 3(2/3(h,d) + 1) and apply Proposition 14.21 to Uq (recall ( 16. 2p ) 
and fj^ = f}2 = Vn,o which satisfy the assumptions 04. ip and ( 14. 2 j) due to ( 16. 3p . 
Claim ( 16. 8 p then follows. The proof of (I6.9P is completely analogous. 

Similarly, applying Proposition 14.21 to the sequence of sets Uk, k = 0, . . . ,n, we 
obtain that 

P[X is connected] > 1 —s.z.(n). 

To bound the middle term on the right-hand side of (16.61) . we consider the sequence 
of random variables 



rpn 

1 k 



p(kei, ^i(fcei)), if kei G I, \\tpi(kei) — fcei|| < n" 
0, otherwise . 



It is clear that on the event {|| , ?/'i(fce 1 ) — fcei|| < n a : k = 0, ...,n}, which by 
Lemma [6.21 has probability 1 — s.c.(n), we have 

n 
fc=0 

To control the sum, we need a tail estimate on that is uniform in n. 
Lemma 6.3. For every k — 0, . . . , n, uniformly in k, we have 

F[T£>£] <s.t.(n)+s.t.(£). 
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Proof. Without loss of generality we consider k = only. First, we fix m < n a /2 
and control the number of trajectories entering B(m). We claim that 

P[|{i < rjn ■ H l B{m) < ti} > c 5 m d ~ 2 ] > 1 - s.e.(m). (6.12) 

To prove (I6.12p we use an argument similar to the proof of Lemma 16.21 We define 
A n — {0 < k < n 6 / 2 , k even}. For every j G A n , and x G Uj, by Lemma [3 .3} 



q x {B{m)-n 2 ^) > 



cm d 2 



((j + l)n a ) d - 2 ' 



Using (16.31) . the number of walks starting in Uj hitting B(m) has a Poisson distri- 
bution with parameter at least 



dm d 2 



cf d (n)m d ~ 2 > J (j + iy-2 i nn ' 



d 



((j + l)n a ) d ~ 2 ~ j dm d ~ 2 > 



U + l) f 



-2 : 



Using the stability of the Poisson distribution, this yields that the number of walks 
starting in {Jj €An Uj hitting B(m) has a Poisson distribution with mean at least 
c"m d ~ 2 . Claim (I6.12p then follows from the large deviation properties of the Poisson 
distribution again. 

We now apply Proposition 14.21 with m instead of n, G m = B(m) and fj[ m \ fj^ 
being the number of walks entering B(m). Assumptions (14.11) . ( 14. 2 p are satisfied by 
the previous discussion. The construction of X assures that the walks do not stop 
earlier than after making 2m 2 < 2n 2a steps. Therefore, by an argument similar to 
proof of flE2D, for d = 2(2/3(/i, d) + 3) 

P[T " > dm 2 } < P[0 G i](Po[-0i i B(m)} + P^[f n > c'm 2 ,^ G B(m)}). 



Both terms in the parentheses are s.e.(m), the first one by Lemma [6.2[ the second 
one by Proposition 14.21 Taking mp such that dm 2 = £, the lemma follows for for 
t < d{n/2) 2a . 

For the remaining ts it suffices to observe that s.e. (£) < s.e. (n) and apply the same 
reasoning as before with B(n a ) instead of B(m). □ 

We can now control the sum (16.111) . To this end we define p, (y), TJ} in the 
same way as p, tp^d/), using X instead of X. Due to (16.41) . 

P[there exists < k < n : f„ fc ^ f„ fe ] < sz.{n). (6.13) 

Therefore, by the previous lemma, for every k, 

P[f fc n > i] <5.t.(n)+s.t.{£). 

The random variable T£ depends only on the random walks that can enter the 
ball B(kei,n a ). As diam(i?j(t fc )) < 2n 2 ( a+£ ^ by definition, setting b n = 5n 2 ( a+6 \ this 
implies that for every j G {1, . . . , b n } the random variables (Tkb n +j '■ k — 1, . . . , n/b n ) 
are independent. 
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Therefore, for large enough c, using the observation above (16. lip , and (16.131) . 



n 

P^i^CneO) > cn/2] < p[^T fc > cn/2 



k=0 
n 

< P[^T fc > cn/2 

k=0 

n/b 



+ s.t(n) 



+ s.t(n) 



(6.14) 



< P 



3j e [l,b n ] : ^2T kbn+j > cn/(2b r , 



k=0 



< 1 — s.e.(n), 

where for the last inequality we used the fact that a < | and then applied a large de- 
viation bound for random variables without exponential moments (e.g., Theorem 1.1 
of |Nag79| ). 

Combining flBTT4j) . with (163|1 . the inequality (ESJ) follows. 

To conclude the proof of Theorem ll.3[ observe that (16.51) implies that for large 
enough c 

F[3k E [n/2,n] such that ke 1 E Z u and p u (ipi, ke,\) > cn] 
<P[ |J {pu(^uMj^))>cn} 

je[n/2,n] 

< s.z.(n), 

and then, since for any k E [0,n/2] such that kei E X u one can write p u {tpi, kei) < 
Pu{^u ^o( ne i)) + p u (V>ci(nei), tei), we have 

P[3fc G [— n, n] such that ke x E X u and p u (^i, kei) > cn] < s.c.(n). (6.15) 

Observe that, by symmetry, (16.151) also holds if one substitutes ei by any coordinate 
vector ej, j = 2,...,d. The claim of the lemma then follows if one writes, on 
x = (xi, . . . , Xd) E X u and E X", 

Pu(0,x) < p n (0,^ ei) (0)) + Pu (^ ei)(0) ,4 ei) (x 1 e 1 )) 

+ p u (ipo( x i e i)^ ^f 2) xiei) + p u (ip[ e2) x^,^^ (x^ + x 2 e 2 )) 
+ ... 

+ p„(V>o (^i e i H h acd_ie d _i), ^ ed) (a;iei H h 2d-ie d _i)) 



+ p u (^ ed) (a;iei + • - • + Xd-ie d _i), ^'(x)), 

and uses the same reasoning for the even terms in the right-hand side, and bounds 
the odd terms using (16.71) and Proposition 14.21 using the same argument as e.g. 
in (16.101) . This completes the proof of Theorem 11.31 □ 

7. Proof of the shape theorem 



( e d). 



In this section, to prove Theorem 11.11 we use more or less standard argument 
based on the Subadditive Ergodic Theorem. For reader's convenience, let us state 
this theorem here (we use the version of |Lig85| ): 

Theorem 7.1. Suppose that {Y(m, n)} is a collection of positive random variables 
indexed by integers satisfying < m < n such that 
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(i) F(0, n) < Y(0, m) + Y(m, n) for all < m < n; 

(ii) The joint distribution of {Y(m + l,m + k + l),k> 1} is the same as that of 
{Y(m, m + k),k > 1} for each m > 0; 

(iii) For each k > 1 the sequence of random variables {Y(nk, (n+ l)k),n > 1} is 
a stationary ergodic process; 

(iv) EY(0, 1) < oo. 

T/ien, ii ZioWs i/mf 

.. Y(0,n) . EF(0,n) 

lim = mi a.s. 

n— >oo n n>0 n 

We are going to verify the hypotheses of Theorem 17.11 for the sequence of random 
variables 

Y(m,n)=p u (^\^), 

under the measure Pq. First, (i) is obvious since p u is a metric. Stationarity and er- 
godicity in (ii)-(iii) follow from the corresponding properties of X u , see Theorem 2.1 
of [SznlOj . The property (iv) then follows from the estimate 

F%[p u (0,4 x) )>n]<s.t.(n), 

which can be proved by applying the same procedure as in (I6.10p . using Lemma [6T21 
and Proposition 14.21 

Theorem 17.11 implies that for any x G Z d there exists a positive number c' u (x) 
such that 



n 



lim " v "' Yn ' = a' u (x) =1. (7.1) 



Then, define for i^O 

and cr u (0) := 0. With (17. ip it is straightforward to obtain (observe that, according 
to our notations, i/j^\nx) is either nx itself in the case nx G T u , or it is the 'last site 
before nx' on the discrete ray {kx, k > 0} if nx ^ X"), using also the usual Ergodic 
Theorem and ( 16. 7p . that 

,. Pu(0,ipo\nx)) 

lim = aJx) Pn-a.s. (7.3) 

n— >oo 77. 

It is also straightforward to obtain that for any integer m and x G Z d , it holds that 
a u {mx) = ma u (x); this permits us to extend a u to Q d by cr u (x) := m _1 <7 u (m:c), 
where m is such that mi G Z d . Also, it is clear that a u (x) > ||x||i for any x G Q d . 
Next, the goal is to prove that a u is a norm. 

Lemma 7.2. For all x,y G Q d we have 

a u (x + y) < a u (x) + a u (y). (7.4) 

Proof. Abbreviate 6^ = EqQ , from the Ergodic Theorem we obtain 

HV'f-i — nx\\ 

lim — = P"-a.s. (7.5) 

n->oo 7*1 

Since p u is a metric, we have (see Figure [3D 

Mo.<^g„)) < ^jgy , ^.« ) ...< ) ..(< ) ..)) 

n n n 
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FIGURE 3. On the proof of Lemma 



for any n. Then, the trick is to take the limit as n — > oo in (I7.6P in probability. 
First of all, a direct application of fl7.ip - fl7.2p shows that the first term in the 
right-hand side of (17.61) converges to cr u (x), even P^-a.s. Next, under Pq it holds 
that pu^'fX , V'f-i (^f-i )) i s equal to p u (0, ijj^X ) in distribution, so the second 

term in the right-hand side of (17. 6p converges to u u {y) in distribution and hence in 
probability. As for the term in the left-hand side of (17. 6p . write 

Uy it u x lis ^ x + y y x x-\-y / l-rr 

n ~ n n 

Again, the first term in the right-hand side of (17. 7p converges Pg-a.s. to <r u (x + y). 
To obtain that the second term in the right-hand side of (17. 7p converges to in 
probability, observe that 

- ^ V) J IK* + V> ~ <CT \\nx - m || 

Oy n o x n °x+y n ^ x+y _|_ 11 b x n" 



n n n 

l< > '.(<l>-<'-.-'H'l 
H - . (7.8) 

Since the third term in the right-hand side of (I7.8P equals in distribution to rT x \\ny — 
■01-1 II j (I7-5P implies that the left-hand side of (17. 8p converges to in probability, 

and so Theorem 11.31 implies that the second term in the right-hand side of (17. 7p 
converges to in probability. This proves (I7.4p . □ 

Now, Lemma 17.21 shows that a u can be extended to a norm in M. d by continuity, 
and we are able to finish the proof of Theorem 11.11 



Proof of Theorem li.il At this point the argument is quite standard. Let 

D u = {x G R d : a u {x) < 1}. 

Let e' = (1 — e)^ 1 — 1, and e" = 1 — (1 + To prove Theorem II .![ it is enough 
to prove that nD u n X" C A"((l + e')n) and A n ((l - e")n) C nD u for all n large 
enough, Pp-a.s. 
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Since D u is compact, one can find a finite set F := {x\, . . . , Xk} C D u fl Q d such 
that <7 u (xi) < 1 for i = 1, . . . , k, and (with C from Theorem II. 3 j) 

^nr*cU%,c-V). 

i=l 

Consider any Xj G F; let m« be the minimal positive integer such that m^Xj G Z d . 
Let n = jrrii + s, where < s < nti — 1. Then, for all n large enough it holds by (17. 3p 
that ^ miXi) (jrrnxi) G A u (n), P^-a.s. 

Now, Theorem 1 1.3 1 (16. 7p and the Borel-Cantelli lemma imply that Pp-a.s. for all n 
large enough we have 

B(nx h C~ l ne') flTc A u (jmiXi,ne'), 

for all i = 1, 2, . . . , k. So nD u fl X u C A"((l + e')n), which completes the first part 
of the proof. 

For the second, choose G := {yi, . . . , C (2D U \ D u ) fl Q d in such a way that 

2D u \D u c\jB( yi ,e"5). 
i=i 

Notice that cr u (yi) > 1 for i — 1, . . . , k. Again, nG(l A u (n) = for all n large enough 
Pg-a.s. Analogously, by Theorem 11.31 and Borel-Cantelli we get that for all n large 
enough, if A u ((l - e")n) n n(2D u \ D u ) ^ 0, then A u (n) n nG ^ 0. This shows that 
A n ((l — e")n) C nD u for all n large enough, Pp-a.s., and so concludes the proof of 
Theorem 11.11 □ 



8. Random walk on the torus 

It remains to show Theorem 11.61 We recall that T d N denotes the d-dimensional 
discrete torus of size N, P N the law of the simple random walk on started from 
the uniform distribution, and p^(x,y) the internal distance within the set of 
sites visited by the random walk before time uN d , = {X , . . . , X\ uN d\}. 

Let Bn(x, r) C be the ball of radius r around x in the usual distance, djsr, on 
the torus. We first control the internal distance in balls of radius In 7 N. 

Lemma 8.1. Let x G C Z d . Then, for C\, 7 large enough, 

P N [there exist y,ze B N (x, In 7 N) such that p u N (y, z) > c x In 7 N] = o(A~ a! ). (8.1) 

Before proving this lemma, let us explain how it implies Theorem 11.61 

Proof of Theorem \l.b\ By the lemma and a simple union bound, with probability 
tending to 1, the event in (18. ip is satisfied for all x G T^. If this is the case, we can 
chain these boxes to obtain the claim of the theorem. More precisely, consider x, y 
such that d n (x, y) > In 7 N. Then, one can find points x — x±, x 2 , ■ ■ ■ , x n — y such 
that Xi + \ G B(xi,\n / N), i < n, and ^^^(^1)^1+1) < 26n{x, y). As we assume 
that the event in (18.11) is satisfied for all balls, for all i < n, 

p%(xi,x i+1 ) < Cxd^XiiXi+x). 

The theorem then follows using the triangular inequality, setting C = 2c\. □ 
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Proof of Lemma \8. 11 Let r = In 7 N and R = Cr, with C of Theorem 11.31 By Theo- 
rem 1.1 of [TWllj . for any a > 0, there exists a coupling Q of random interlacement 
on Z d and random walk on the torus, such that 

Q[(pQ-*) n B(x,R)) C (l^nB N (x,R)) C (l u{1+£) nB{x,R))] > 1 — N~ a . 

For points that are in Z M ( 1_e ) fl B(x,r) we can use Theorem 11.31 and obtain the 
required statement. For points in \T U ^ 1 ~ £ \ however, this simple argument fails 
and we need more details on the coupling construction. 

The construction starts by splitting the random walk trajectory into so-called ex- 
cursions. These excursions are independent simple random walk trajectories started 
at the boundary of Bn(x, R) and stopped when staying a sufficiently long time out 
of Bjf(x, N l ~ e ), see Section 4 of |TW11] for the precise definition. 

We denote the excursions started before time uN d by . . . ,X^\ where 77 is 
random. These excursions are constructed in such a way that 

r/-l n 

(|jRanX (i) nB{x,RU C (l u N C\ B{x, R)) C ( [j RanX (i) n B(x, R)j . (8.2) 

i=l i=l 

Using Lemma 4.3 of |TWllj . it is easy to see that the random variable r\ satisfies 

P N [(u + e)capB(x,R) > rj > (u - e) c&pB{x, R)} > 1 _ e - cca P B M). (8.3) 

Further, combining Lemmas 3.9, 3.10 of [TWllj . it follows that the distribution of 
the starting points satisfies for every z G dB(x, R) and i = 2, . . . , R 

P [X ° ~ Z] - cap £(*,!?) ' (8 ' 4) 

that is, it is close to the normalised equilibrium measure. The distribution of the 
first excursion cannot be controlled so precisely, as in principle it can start inside of 
B(x, R), but it is not issue for us. 

The proof of Lemma 18.11 is then completely analogous to proof of Theorem 11.31 It 
suffices to observe that the only property of the random interlacement that we used 
in the proof of Theorem II. 31 are the bounds on the number and starting distribution 
of trajectories entering a fixed set. These bounds follow from (18. 3p . (18. 4p . Finally 
one should observe that s.t(R) = o(N~ d ) if 7 is chosen large enough. 

There is a small issue with the fact that the last point of the trajectory, X\ uN a\ , 
might be contained in the last excursion, cf. (18.21) . To solve this issue, observe 
that our techniques apply to the both 'clusters' C := (J^ RanX^ and C : = 
UiLi RanXW. Hence, with probability 1 — s.c.(R), the internal distances on these 
clusters within B(x, r) are bounded by c\r. If this is the case, then the trajectory of 
X^) mus t intersect C at least every (2cj + l)-steps. For any x E B(x, r) nX^) nl]J 
there is thus a path of length at most (2ci + 1) lying inside of which con- 
nects x to C. Lemma f8.ll then follows by triangular inequality, by increasing c\ to 
2(2 Cl + l) + ci. □ 

Appendix A. Domination by Bernoulli percolation 

We sketch here a simple argument proving Theorem 11.31 in d > 5, with 5 = 1. 
This argument is based on the domination of the interlacement set X u in thick two- 
dimensional slabs by the standard Bernoulli percolation. This domination seems to 
be folklore in the random interlacement community, but to our knowledge it does 
not appear in any previous publications. 
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Let K be a sufficiently large constant and e G (0, 1). Let E 2 be the set of nearest- 
neighbour edges of Z 2 , and for every e = (x,y) G E 2 , let 

K 

G e = [J 5(Ax + i(y - x), K £ ) C Z d , 

8=0 

where we standardly identify x = (x\, x 2 ) G Z 2 with (xi, x 2 , 0, . . . , 0) G Z d . G e is a 
thin parallelepiped of length K + 2A £ and width 2K £ along the scaled edge Ke. 

Let W* C W* (recall Section |2] for the notation) be the set of all doubly-infinite 
trajectories modulo time shift that hit G e but not U e ' dist(e e')>i ^ e '' ^ ne ^ ac ^ ^hat 
the co-dimension of Z 2 is larger than 3, that is, the random walk is transient in the 
direction perpendicular to Z 2 , can be used to show that 



P 



oj G Q contains a trajectory with label smaller 
than u that intersects G P but is not in W* 



1. (A.l) 



From Lemma 2 of [RSI lb], it follows that the probability that there is a con- 
nection along the long direction of G e within X u can be made arbitrarily large by 
increasing K. Let C e be the probability that this connection uses only the trajecto- 
ries in W*. Using flA.lj) . it follows that the probability of C e can be made arbitrarily 



large by increasing K too. 

Moreover, Proposition 1 of [R Sllbj (or our Proposition 14. 2p can be used to show 
that for any x G Z 2 C Z d and v, w G X u fl B(Kx, K £ ) there is connection of v and w 
within X u fl B(Kx,2K £ ) with probability tending to 1 as K increases. We denote 
by V x the event that this connection uses only the trajectories in [j eBx W*. Again, 
applying ( lA.lj) and choosing K large, the probability of T> x can be made arbitrarily 
large. 

Finally, call the edge e = (x, y) G E 2 good, when C e fl V x fl V y occur. It follows 
that the probability that e is good can be made arbitrarily large by choosing K 
large. Since W* and W* are disjoint subsets of W* when dist(e,e') > 1, the events 
'e is good' and 'e' is good' are independent when dist(e,e') > 4. Moreover, using 
the events V x , the connections realising C e , C e > in two adjacent good edges e, e' can 
be connected to form one path. 

Using the domination argument of [LSS97J, we see that for K large the good edges 
dominate the supercritical Bernoulli percolation on Z 2 , in particular, there is with 
probability one an infinite cluster C C Z 2 of good edges. Moreover, when x,y G C 
are connected by a path of length I in C, B(xK, K £ ) and B(yK, K £ ) are connected 
by a path of length at most i{(K + 2K £ )(2K £ + l)^ 1 + 2(AK £ + l) d ] within X u 
(the factor in braces is simply the volume of the parallelepiped G e plus volume of 
the two boxes B(x 2K e ), B(y,2K £ )). 

Theorem 1.1 of [AP96J then implies that the claim of Theorem 11.31 holds (with 
5 = 1) for all x G X u fl IJyec B{y, N £ ). The extension to all x G X u is then trivial 
by repeating the argument for other coordinate directions and using Proposition 1 
of |RSllb] or Proposition 14 . 2 1 to made the final connections to those x's that are not 
in the A^-neighbourhood of KC. 
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